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STATISTICAL ASPECTS OF THE FRACTIONAL 
STOCHASTIC CALCULUS 
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Universite de Paris 1 Pantheon-Sorbonne and Purdue University 

We apply the techniques of stochastic integration with respect to 
fractional Brownian motion and the theory of regularity and supre- 
mum estimation for stochastic processes to study the maximum likeli- 
hood estimator (MLE) for the drift parameter of stochastic processes 
satisfying stochastic equations driven by a fractional Brownian mo- 
tion with any level of Holder-regularity (any Hurst parameter). We 
prove existence and strong consistency of the MLE for linear and 
nonlinear equations. We also prove that a version of the MLE using 
only discrete observations is still a strongly consistent estimator. 

1. Introduction. Stochastic calculus with respect to fractional Brownian 
motion (fBm) has recently experienced intensive development, motivated by 
the wide array of applications of this family of stochastic processes. For ex- 
ample, recent work and empirical studies have shown that traffic in modern 
packet-based high-speed networks frequently exhibits fractal behavior over a 
wide range of time scales; in quantitative finance and econometrics, the frac- 
tional Black-Scholes model has recently been introduced (see, e.g., [14, 17]) 
and this motivates the statistical study of stochastic differential equations 
governed by fBm. 

The topic of parameter estimation for stochastic differential equations 
driven by standard Brownian motion is of course not new. Diffusion pro- 
cesses are widely used for modeling continuous time phenomena; therefore, 
statistical inference for diffusion processes has been an active research area 
over the last few decades. When the whole trajectory of the diffusion can be 
observed, then the parameter estimation problem is relatively simple, but of 
practical contemporary interest is work in which an approximate estimator, 
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using only information gleaned from the underlying process in discrete time, 
is able to do as well as an estimator that uses continuously gathered infor- 
mation. Several methods have been employed to construct good estimators 
for this challenging question of discretely observed diffusions; among these 
methods, we refer to numerical approximation of the likelihood function (see 
[1, 5, 32]), martingale estimating functions (see [6]), indirect statistical in- 
ference (see [16]), the Bayesian approach (see [15]), some sharp probabilistic 
bounds on the convergence of estimators in [7], and [10, 12, 31] for particular 
situations. We mention the survey [36] for parameter estimation in discrete 
cases, further details in [21, 25] and the book [23]. 

Parameter estimation questions for stochastic differential equations driven 
by fBm are, in contrast, in their infancy. Some of the main contributions 
include [18, 19, 20, 33]. We take up these estimation questions in this article. 
Our purpose is to contribute further to the study of the statistical aspects 
of the fractional stochastic calculus by introducing the systematic use of 
efficient tools from stochastic analysis, to yield results which hold in some 
nonlinear generality. We consider the stochastic equation 



where B H is a fBm with Hurst parameter H £ (0,1) and the nonlinear func- 
tion b satisfies some regularity and nondegeneracy conditions. We estimate 
the parameter 9 on the basis of the observation of the trajectory of the pro- 
cess X. The parameter H, which is assumed to be known, characterizes the 
local behavior of the process, with Holder-regularity increasing with H; if 
H = 1/2, fBm is standard Brownian motion (BM) and thus has indepen- 
dent increments; if H > 1/2, the increments of fBm are positively correlated 
and the process is more regular than BM; if H < 1/2, the increments are 
negatively correlated and the process is less regular than BM. H also char- 
acterizes the speed of decay of the correlation between distant increments. 
Estimating long-range dependence parameters is a difficult problem in itself 
which has received various levels of attention depending on the context; the 
text [3] can be consulted for an overview of the question; we have found 
the yet unpublished work [11], available online, which appears to propose 
a good solution applicable directly to fBm. Herein, we do not address the 
Hurst parameter estimation issue. 

The results we prove in this paper are as follows: for every H in (0, 1), 

• we give concrete assumptions on the nonlinear coefficient b to ensure ex- 
istence of the maximum likelihood estimator (MLE) for (Proposition 



• under certain hypotheses on b which include nonlinear classes, we prove 
the strong consistency of the MLE (Theorems 2 and 3, depending on 
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whether H < 1/2 or H > 1/2; and Proposition 2 and Lemma 3 for the 
scope of nonlinear applicability of these theorems); note that for H > 1/2 
and b linear, this has also been proved in [18]; 

• the bias and mean-square error for the MLE are estimated in the linear 
case (Proposition 3); this result was established for H > 1/2 in [18]. 

In this paper we also present a first practical implementation of the MLE 
studied herein, using only discrete observations of the solution X of equation 
(1), by replacing integrals with their Riemann sum approximations. We show 
that 

• in linear and some nonlinear cases, the discretization time-step for the 
Riemann sum approximations of the MLE can be fixed while still allowing 
for a strongly consistent estimator (Proposition 5 and Theorem 4). 

To establish all these results, we use techniques in stochastic analysis 
including the Malliavin calculus, and supremum estimations for stochastic 
processes. The Malliavin calculus, or the stochastic calculus of variations, 
was introduced by P. Malliavin in [27] and developed by D. Nualart in [29]. 
Its original purpose was to study the existence and the regularity of the 
density of solutions to stochastic differential equations. Since our hypotheses 
in the present paper to ensure existence and strong consistency of the MLE 
are given in terms of certain densities [see condition (C)], the techniques of 
the Malliavin calculus appear as a natural tool. 

We believe our paper is the first instance where the Malliavin calculus and 
supremum estimations are used to treat parameter estimation questions for 
fractional stochastic equations. These techniques should have applications 
and implications in statistics and probability reaching beyond the question of 
MLE for fBm. Indeed, even in the case of (ltd-) diffusion models, the strong 
consistency of an estimator follows if one can prove that an expression of 
the type It ■= Jq f 2 (X s ) ds tends to oo as t — ► oo almost surely, but a limited 
number of methods has been employed to deal with this kind of problem (if 
X is Gaussian, the Laplace transform can be computed explicitly to show 
that lim^oo/t = oo a.s.; if X is an ergodic diffusion, a local time argument 
can be used; particular situations have also been considered in [21, 22]). Our 
stochastic analytic tools constitute a new possibility, judging by the fact that 
the case of H < 1/2 is well within the reach of our tools, in contrast to the 
other above-mentioned methods, as employed, in particular, in [18] (see, 
however, a general Bayesian-type problem discussed in [19]). 

The organization of our paper is as follows. Section 2 contains prelimi- 
naries on the fBm. In Section 3 we show the existence of the MLE for the 
parameter 9 in (7) and in Section 4 we study its asymptotic behavior. Sec- 
tion 5 contains some additional results in the case when the drift function is 
linear. In Section 6 a discretized version of the MLE is studied. Some crucial 
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technical proofs appear in the Appendix, while other more minor ones can 
be found in an extended version of this article posted on arXiv.org. 

2. Preliminaries on fBm and fractional calculus. We consider (-BfOtelo.T] > 
Bq = 0, a fBm with Hurst parameter H E (0,1), in a probability space 
(£l,!F, P), that is, a centered Gaussian process with covariance function R 
given by 

(2) R(t, s) = E[Bf Bf ] = \ (t 2H + s 2H -\t- s \ 2H ), s,t& [0,T]. 
Let us denote by K the kernel of the fBm such that (see, e.g., [28]) 

(3) Bf = f K(t,s)dW s , 

Jo 

where W is a Wiener process (standard Brownian motion) under P. Denote 
by £h the set of step functions on [0, T] and let 7i be the canonical Hilbert 
space of the fBm; that is, 7i is the closure of £ with respect to the scalar 
product (l[o,t]i l[o,s]}« = R(t,s). The mapping lr o t i — > Bf- can be extended 
to an isometry between Ti and the Gaussian space generated by B and we 
denote by B H ((p) the image of ip £ 7i by this isometry. 

We also introduce the operator K* from £h to L 2 ([0,T]) defined by 

(4) (#V)(*) = ^(r, s)v(a) + /^(O " ^ s ))^§r^ s ) dr- 

With this notation, we have (K*l^^)(s) = K(t,s) and hence, the process 

(5) W t = j\K*>-H m ){s)dBf 

is a Wiener process (see [2]); in fact, it is the Wiener process referred to in for- 
mula (3), and for any nonrandom ip G "H, we have B H (ip) = Jq (K*cp)(s) dW{s), 
where the latter is a standard Wiener integral with respect to W . 

Last, we will use some elements of fractional calculus, with notation 
that can be found in [30] or our arXiv extended version, including the 
Riemann-Liouville fractional integral Io + f(t) and derivative Dq, f(t) of 
any / £ L 1 [0,T] for a > 0. The linear isomorphism Kh from L 2 ([0,T]) onto 
/+ +1/2 (L 2 ([0,T])) whose kernel is K(t,s) has an inverse which, depending 
on whether H < 1/2 or H > 1/2, equals 

(K^h)(s) = s H -^lii 2 - H (s^- H h'(s))(s), or 

(6) 

= s H - 1 / 2 D^- l/2 (s 1 / 2 ~ H h'(s))(s). 
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3. The maximum likelihood estimator for fBm-driven stochastic differen- 
tial equations. We will analyze the estimation of the parameter # 6 C R 
based on the observation of the solution X of the stochastic differential 
equation 



(7) X t = 9 I b(X 8 )ds + B?, x = o, 

Jo 



lo 

where B H is a fBm with H € (0, 1) and b : R — > R is a measurable function. 
There are some strong known results concerning equation (7). In [30] strong 
existence and uniqueness is proved assuming only the linear growth 

|6(s)|<C(l + |a:|) 

for b when H < 1/2, and assuming Holder-continuity of order a € (1— Jjj, 1) 
when H > 1/2. An extension is obtained in [8] when H > 1/2 if one adds 
a bounded nondecreasing left (or right) continuous function to the Holder- 
continuous function b. In this paper we will only be using Lipshitz-continuous 
functions b. 

Throughout the paper, we will typically avoid the use of explicit in- 
dependent constants appearing in the definitions of the operator kernels 
related to this calculus, since our main interest consists of asymptotic proper- 
ties for estimators. In consequence, we will use the notation C '(H) , c(H) , ch , ■ ■ 
for generic constants depending on H, which may change from line to line. 

Our MLE construction is based on the following observation (sec [30]). 
Consider the process = Bf + J^Usds, where the process u is adapted 
and with integrable paths. Then we can write 

(8) Bf= f t K(t,s)dZ s , 

Jo 

where 

(9) Z t = W t + J* K H X Qf u r dr^j (s) ds. 
We have the following Girsanov theorem. 

Theorem 1. (i) Assume that u is an adapted process with integrable 
paths such that 

f u s dsG/£. +1/2 (L 2 ([0,T])) a.s. 
Jo 

(ii) Assume that E(Vx) = 1, where 

V T = exp (- [ T K H l ( [ u r dr) (s) dW s 

(10) V Jo Wo 7 

1 / / f-1 

2 
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Then under the probability measure P defined by dP/dP = Vp, it holds that 
the process Z defined in (9) is a Brownian motion and the process B (8) 
is a fractional Brownian motion on [0,T\. 

Hypothesis. We need to make, at this stage and throughout the re- 
mainder of the paper, the following assumption on the drift: b is differen- 
tiable with bounded derivative b'; thus, the affine growth condition holds. 

This Girsanov theorem is the basis for the following expression of the 
MLE. 

Proposition 1. Denote, for every t G \0,T], 

(11) Q t = Q t (X) = Kjj 1 (J b(X r ) dr^j (t). 

Then Q £ L 2 ([0,T]) almost surely and the MLE is given by 

. _ J*Q s dW s 



(12) 



So Q 2 S ds 

Before proving Proposition 1, we need the following estimates. 
Lemma 1. For every s,t£ [0,T], 



(13) sup \X S \<( COt + sup |J3f | e cet 

s<t V s<t ) 

and 

(14) \X t - X s \ < Ce(l + sup |X u |V-s| + |Bf -Bf|. 

V u<T J 

Proof. With C the linear growth constant of b, we have, for any s, 
\X S \ < [ S \6\\b(X u )\du + \B*\<C8 [\l + \X u \)du + sup\B*\ 

JO JO u<s 

and by Gronwall's lemma, 

\X S \< (ces + sup\B^\)e C9s , se[0,T], 

\ u<s J 

and the estimate (13) follows. The second estimate follows by 6's affine 
growth. □ 

Proof of Proposition 1. Let 

ft 



h(t)= f b{X s )ds. 
Jo 
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We prove that the process h satisfies (i) and (ii) of Theorem 1. Note first 
that the application of the operator Kjj 1 preserves the adaptability. We 
treat separately the cases when H is greater than 1/2 and less than 1/2 . 

The case H < 1 /2. To prove (i), we only need to show that Q € L 2 ([0, T]) 
P-a.s. Now using relation (6) we thus have, for some constant Ch which 
may change from line to line, using the hypothesis \b(x)\ < C(l + \x\), for 
all s < T, 



(15) 



\Qs\<C H s H ~^ f 
Jo 



-1/2-H..1/2-H 



u L '^ H b(X u )du 



<C H ( l + sup|Ag J, 

u<s 



which we can rewrite, thanks to Lemma 1, as 

sup|Q s |<C(i7,T)(l + sup|X(s) 



s<T 



s<T 



which, thanks to inequality (13), is of course much stronger than Q G L 2 ([0, T]) 
a.s. 

To prove (ii), it suffices to show that there exists a constant a > such 
that 

supE(exp(a(5s)) < oo. 

s<T 

Since Q satisfies (15), the above exponential moment is a trivial consequence 
of inequality (13) and Fernique's theorem on the exponential integrability 
of the square of a seminorm of a Gaussian process. 



The case H > 1/2. Using formula (6), we have in this case that 



Qs 



CH 



A/2-H 



b(X s ) 



1 



+ [H- L -\s H -^ 



b{X s )s 1 ' 2 - H - bjX^u 1 ' 2 - 11 
{s - u) H +V 2 



du 



(16) 



CH 



M2-H 



b(X s 



1 



+ [H- 1 -)s H ^ 2 b{X s 



{s-u)^ 1 / 2 
b(X s )-b(X u )^ 1/2 _ H 



du 



{s-u) H +V 2 



■11 



du 
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and using the fact that 

1/2-H _ u l/2-H^ s _ ,A-H-l/2 A| = r(mfl l-2H 







){s- U y H ~ 1/2 du = c(H)s 1 - 



we get 

\Q.\ < c H (sV*-»\ b (X s )\ + S -V 2 /J b [^-^ u^ du 



:=A{s) + B(s). 

The first term A(s) above can be treated as in [30], proof of Theorem 3, 
due to our Lipschitz assumption on b. We obtain that, for every A > 1, 

(17) E (^exp (x J* A 2 s ds\\ < oo. 

To obtain the same conclusion for the second summand B(s), we note that, 
by Lemma 1, up to a multiplicative constant, the random variable 

\X S — X u \ 

G = sup 



0<u<s<T \U ~ S\ H 6 

is bounded by 

\B H -B H \ 



1 + sup \X U \ I \t - + sup 

u<T J 0<u<s<T \U ~ S\ n £ 

and it suffices to use the calculations contained in [30]. 

Conclusion. Properties (i) and (ii) are established for both cases of H, 
and we may apply Theorem 1. Expression (12) for the MLE follows a stan- 
dard calculation, since [using the notation Pq for the probability measure 
induced by (X s )o< s <t and the fact that Pq = P] 



dP Z* 1 2 ft 

(18) F(0) := log -± = -9 / Q s dW s - - / Q 2 s ds. u 

aP Jo 2 Jo L - 1 

We finish this section with some remarks that will relate our construction 
to previous work [18, 19, 33]. Details about these links are given in Section 
5. 

Alternative form of the MLE. By (7), we can write, by integrating the 
quantity K*'~ 1 l[o,t]{s) for s between and t, 

(19) ^(ir*'- 1 l M (-))(s)dX s = 0^(ir*'- 1 l [ o !t] (-))(s)6(^)^ + ^- 
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On the other hand, by (7) again, 

(20) X t = f K(t,s)dZ s , 

Jo 

where Z is given by (9). Therefore, we have the equality 

(21) j\K*>- x l m (-))(s)dX s = Z t . 
By combining (19) and (21), we obtain 

J K H X (J b(X r ) dr) (s) ds = j\K*^l m {-)){s)b{X s ) ds 

and thus, the function f t (K*'~ 1 l[ ^(-))(s)b(X s ) ds is absolutely contin- 
uous with respect to the Lebesgue measure and 

(22) Q t = j t j\K*>- 1 l M (-)){s)b(X 8 )ds. 
By (9), we get that the function (18) can be written as 



F{9) = -6 f t Q s dZ s + ^ fQlds. 
JO 2 Jo 



As a consequence, the maximum likelihood estimator 9t has the equivalent 
form 

/ 9 o\ a _ Jo Qs dZ s 

J Qjds 

The above formula (23) shows explicitly that the estimator 9t is observable 
if we observe the whole trajectory of the solution X . 

4. Asymptotic behavior of the maximum likelihood estimator. This sec- 
tion is devoted to studying the strong consistency of the MLE (12). A similar 
result has been proven in the case b(x) = x and H > 1/2 in [18]. We propose 
here a proof of strong consistency for a class of functions b which contains 
significant nonlinear examples. By replacing (9) in (23), we obtain that 

dt e _ tiQsdW s ^ 
IoQ 2 s ds 

with Q given by (11) or (22). To prove that 9t — ► 9 almost surely as t — > oo 
(which means by definition that the estimator 0% is strongly consistent), by 
the strong law of large numbers we need only show that 

rt 

a.s. 



(24) lim / Q 2 s ds = oo 

t->°o Jq 
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To prove that lim^oo Jq Q 2 s ds = oo in a nonlinear case, it is necessary to 
make some assumption of nondegeneracy on the behavior of b. In order to 
illustrate our method using the least amount of technicality, we will restrict 
our study to the case where the function \b\ satisfies a simple probabilistic 
estimate with respect to fractional Brownian motion: 

(C) There exist positive constants to and Kf,, both depending only on H 
and the function b, and a constant 7 < 1/(1 + H) such that, for all t>to 
and all e > 0, we have P[\Qt(u))\/vi < s] < et lH K^, where under P, uj 
has the law of fractional Brownian motion with parameter H. 

4.1. The case H < 1/2. In this part we prove the following result. 

Theorem 2. Assume that H < 1/2 and that condition (C) holds. Then 
the estimator 9t is strongly consistent, that is, 

lim 9t = 6 almost surely. 

t—KX> 

Before proving this theorem, we discuss condition (C). To understand this 
condition, we first note that, with fJ H the positive measure on [0,t] defined 
by /Lt^-(dr) = (r /t) l l 2 ~ H (t — r ^~ l / 2 - H dr, according to the representation (6) 
we have 

Qt = f /4r(ds)&(w a ) 







and therefore, by the change of variables r = s/t, 
(25) 9l = [\ Udr) ^r) 



Vi'Jo hHy ' t» 
(26) = / n H {dr) 







t H 



where the last inequality is in distribution under P. 

If b has somewhat of a linear behavior, we can easily imagine that b(t H Lo r )/t H 
will be of the same order as b(u> r ). Therefore, Qt/Vt should behave, in distri- 
bution for fixed t, similarly to the universal random variable f fj,jj(dr)b(u r ) 
(whose distribution depends only on b and H). Generally speaking, if this 
random variable has a bounded density, the strongest version of condition 
(C), that is, with 7 = 0, will follow. In the linear case, of course, the factors 
t H disappear from expression (26), leaving a random variable which is in- 
deed known to have a bounded density, uniformly in t, by the arcsine law. 
The presence of the factor t lH in condition (C) gives even more flexibility, 
however, since, in particular, it allows a bound on the density of Qt/Vt to 
be proportional to t H . 



FRACTIONAL BROWNIAN MOTION 



11 



Leaving aside these vague considerations, we now give, in Proposition 2, 
a simple sufficient condition on b which implies condition (C). The proof of 
this condition uses the tools of the Malliavin calculus; as such, it requires 
some extra regularity on b. We also give a class of nonlinear examples of 
b's satisfying (C) [condition (28) in Lemma 3] which are more restricted in 
their global behavior than in Proposition 2, but do not require any sort of 
local regularity for b. 

Proposition 2. Assume H < 1/2. Assume that b' is bounded and that 
b" satisfies \b"(x)\ < h/(l + \xf) for some P S (H/(l — H), 1) . Assume that 
\b'\ is bounded below by a positive constant bo. Then, letting 7=1-/3, con- 
dition (C) holds. 

Remark 1. The condition 7 < 1/(1 + H) from condition (C) translates 
as (3 > H/ (1 — H), which is consistent with < 1 because H < 1/2. 

The nondegeneracy condition on \b'\ above can be relaxed. If, for x > xq, 
\b'(x)\ > x~ a holds, then condition (C) holds as long as a does not exceed a 
positive constant ao(H) depending only on H. We omit the very technical 
proof of this fact. 

The hypothesis of fractional power decay on b" , while crucial, does allow 
b to have a highly nonlinear behavior. Compare with Lemma 3 below, which 
would correspond to the case [3 = 1 here. 

The hypotheses of the above proposition imply that b is monotone. 

The proof of Proposition 2 requires a criterion from the Malliavin calculus, 
which we present here. The book [29] by D. Nualart is an excellent source 
for proofs of the results we quote. Here we will only need to use the following 
properties of the Malliavin derivative D with respect to W [recall that W is 
the standard Brownian motion used in the representation (3), i.e., defined 
in (5)]. For simplicity of notation, we assume that all times are bounded 
by T = 1. The operator D, from a subset of L?(Vt) into L 2 (0 x [0,1]), is 
essentially the only one which is consistent with the following two rules: 

1. Consider a centered random variable in the Gaussian space generated by 
W (first chaos); it can therefore be represented as Z = W( f) = Jq 1 f(s) dW(s) 
for some nonrandom function / G L 2 ([0, 1]). The operator D picks out the 
function /, in the sense that, for any r E [0, 1], 

D r Z = f(r). 

2. D is compatible with the chain rule, in the sense that, for any <!> G C 1 (R) 
such that both F := $(Z) and $'(Z) belongs to L 2 (0), for any r £ [0, 1], 

D r F = D r $(Z) = &(Z)D r Z = <P'(Z)f(r). 
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For instance, using these two rules, definition (3) and formula (5) relative 
to the fBm uj under P, we have that, under P, for any r < s, 



(27) D r b(t u u s ) =t H b'(t H Cj s )K(s,r). 

It is convenient to define the domain of D as the subset D 1,2 of r.v.'s 
F e L 2 (Q) such that D.F 6 L 2 (Q x [0, 1]). Denote the norm in L 2 ([0, 1]) by 
|| • || . The set D 1,2 forms a Hilbert space under the norm defined by 

||i^||? 2 = E|F| 2 + E||L>.F|| 2 = E|F| 2 + E \D r F\ 2 dr. 

Jo 

Similarly, we can define the second Malliavin derivative D 2 F as a member 
of L 2 (Q x [0, l] 2 ), using an iteration of two Malliavin derivatives, and its as- 
sociated Hilbert space D 2,2 . Non-Hilbert spaces, using powers other than 2, 
can also be defined. For instance, the space D 2,4 is that of random variables 
F having two Malliavin derivatives and satisfying 

||F|| 4 4 = E|F| 4 + E||D.F|| 2 + E||D 2 F|| 4 2([0jl]2) 

1 / rl \ 2 



E|F| 4 + E^ \D r F\ 2 dr + B^ \D r D s F\ 2 dr ds^j < 



oo. 



We also note that the so-called Ornstein-Uhlenbeck operator L acts as fol- 
lows (see [29], Proposition 1.4.4): 

LF = L$(Z) = -Z&(Z) + $"{Z)\\f\\ 2 . 

We have the following Lemma, whose proof we omit because it follows from 
([29], Proposition 2.1.1 and Exercise 2.1.1). 

Lemma 2. Let F be a random variable in D 2 ' 4 such that 'E[\\DF\\' S ] < 
oo. Then F has a continuous and bounded density f given by 



f(x)=E 



-LF jDF®DF;D 2 F) L 2 m] 2 ) 
( F>x) \\\DF\\ 2 2 HDFH 4 



Proof of Proposition 2. Using (26), and the notation /U = /xf , let 
F = 9l-(\ {dr) b ^) 



Vt Jo r " ' t H ■ 

It is sufficient to prove that F has a density which is bounded by Kf,t lH , 
where the constant Kf, depends only on b and H. Indeed, P[\Qt{oj)\/\^t < 
e] < Jo Kbt lH dx = et lH K},. In this proof, Cb,H denotes a constant depending 
only on b and H, whose value may change from line to line. 
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Step 1: calculating the terms in Lemma 2. We begin with the calculation 
of DF. Since the Malliavin derivative is linear, we get D r F = t~ H Jq fi(ds) x 
D r (b(t H (Jj s )). Then from (27) we get D r F = J^(ds)b'(t H ^ s )K(s,r). Thus, 
we can calculate 



\DF\ 



l 

dr 

1 r l 



JO 
1 r l 



fi(ds)b'{t H Cb s )K(s,r) 
n{ds)n{ds')b'{t H Cj s )b'(t H £j s 



min(s,s') 



K(s, r)K(s, r') dr 



o Jo 



n(ds)ii(ds')b'(t H Lu s )b'(t H u; s/ )R(s,s'), 



where R is the covariance of fBm in (2). A similar calculation yields 

n(ds)b"(t H Q s )K(s, r)K(s, q) 



D^F = t H 



9i r 



max(g,r) 



and 



\ d2f \\l 2 ([o,i] 2 ) 



t 
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1 rl 



JO 



li(ds)n{ds')b"(t H u s )b"{t M £o s ,)\R{s,s')\ 



lit +Hr. 



7M2 



For the Ornstein-Uhlenbeck operator, which is also linear, we get 



-LF 



Li{ds){b'(t H Cj s )u s + b"(t tl Co s )t n s zn ) 



nt4.Hr. \+H 2H\ 







Step 2: estimating the terms in Lemma 2. With the expressions in the 
previous step, using the hypotheses of the proposition, we now obtain for 
some constant Ch depending only on H, 



E[\LF\]<C H Halloo +*% / Kds)s 2H E 



1 



C H [ ||6'||oo+t%E 



fi(ds)s 



2H 



l + tPH\u s \P 
1 



l + {ts)P H \Zf_ 

where Z is a generic standard normal random variable. We deal first with 
the integral in s. For s G [0,1/2], fi(ds) has a bounded density, and thus, 



for any a > and any a < 1, we have J Q 1//2 ds(l + as a ) 1 < (1 — a) 
the other hand, for s G [1/2, 1], we can bound (1 + (ts^lZfy 1 above by 
(t/2)~ /3H \Z\~ l3 ~ 1 . We immediately obtain, using a = t l3H \Z\ 13 and a = j3H, 



on 



E[|LF|] <CW 1 + t 



ifr(i-j8). 



<CWl+t 



£T(1-/J)> 
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The estimation of ||.D 2 -F|| is similar. Using its expression in the previous 
step, the measure d/i/ds-dfi/ds' -R(s, s') and the fact that J ds(l + as a )~ 2 < 
(1 - 2a)- l a- 2 , with a = /3H < 1/2, we get 

E||D 2 F|| L 2 {[0il]2) <a,^ {1 - /3) . 

Also, almost surely, for any p>2, for some constant Cjj,p depending only 
on H and p, since b' has a constant sign, we obtain 

1 / f \ ~ p l 2 

= i a p(ds)fi(ds')R(s, s')\b'(t H u s )b'(t H u s ,)\) < C H , p b p . 

Last, it is convenient to invoke the Cauchy-Schwarz inequality to get 
{DF®DF-D 2 F) LH%1?) < ||ZJ 2 F|| L2([0il]2) 
\\DF\\ A ~ \\DF\\ 2 

Step 3: applying Lemma 2; conclusion. The third estimate in the previ- 
ous step (for p = 8) proves trivially that EUDi 7 !! -8 is finite. That F £ D 2 ' 4 
follows again trivially from the boundedness of b' and b" using the expres- 
sions in Step 1. Thus, Lemma 2 applies. We conclude from the estimates in 
the previous step that F has a density / which is bounded as 

f{x)<C H ,{l + t H ^))bf. 

With t > 1, the conclusion of the proposition follows. □ 

A smaller class of functions b satisfying condition (C) and covering all 
H G (0, 1) is given in the following, proved in the extended version of this 
article on arXiv.org. 

Lemma 3. Let H G (0,1). Assume xb(x) has a constant sign for all 
x € R+ and a constant sign for all x G R_. Assume 

(28) \b(x)/x\=c + h(x) 

for all x, where c is a fixed positive constant and limz^oo h(x) = 0. Then 
condition (C) is satisfied with 7 = 0. 

Condition (C) also holds for any b of the above form to which a constant 
C is added: \(b(x) — C)/x\ = c + h(x) and lirn^oo h(x) = 0. Note that this 
condition is less restrictive than saying b is asymptotically affine, since it 
covers the family b(x) = C + cx + (\x\ A 1) Q for any a G (0, 1). In some sense, 
condition (C) with 7 = appears to be morally equivalent to this class of 
functions. 



Proof of Theorem 2. 
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Step 1: setup. Since we only want to show that (24) holds, and since 
Jo \Qs\ 2 ds is increasing, it is sufficient to satisfy condition (24) for t tending 
to infinity along a sequence (£ n )neN- We write, according to (6), for each 
fixed t > 0, 



I t = I t (X):= \Q s (X)\ 2 ds = / fi s H (dr)b(X r 
Jo Jo Jo 



2 

ds, 



where X is the solution of the Langevin equation (7) and the positive mea- 
sure ii s H is defined by ^{dr) = (r / s) l l 2 ~ H (s — r)~ l l 2 ~ H dr. Since the the- 
orem's conclusion is an almost sure statement about X, and from the Gir- 
sanov Theorem 1 applied to X the measures P and P are equivalent, it is 
sufficient to work under P, that is to assume that X = Co is a standard fBm; 
we omit the dependence of Q s and It on X = Co for simplicity. 

We will show that with t n = n k for some positive integer k chosen below, 
It n converges to oo, by restricting the integration defining If n to a small 
interval of length b n = n~ 3 near t n , where j will be another integer chosen 
below. Indeed, 



fin fin 

hn= / \Qs\ 2 ds> / \Q s \ 2 ds 

JO Jtn—bn 

> b n \Q t J 2 ~ t \Qtn - Qs\\Qt n + Qs\ ds 

(29) >b n (\Q tn \ 2 - sup IQ^-Q.IIQ^ + Q.l) 

> b n \Q t J 2 - 2b n sup \Q S \ sup \Qt n ~Qs 

se[0,tn\ s£[t n -b n ,tn] 

■ = A n — B n . 



Step 2: the diverging term A n . The term A n is easily shown to converge 
to infinity almost surely thanks to condition (C) modulo a condition on j 
and k. Indeed, we have, for any sequence a n , 

F[bn\QtJ 2 < a 2 n ] < K h tl H a n {b n t n y l l 2 = K h n k ^ H -W+il 2 a n . 

In order for the right-hand side of the last expression to be summable in n 
while being able to choose lim^^oo a n = 0, it is sufficient to impose 

(30) j + 2 < (1 - 2-/H)k. 

Thus, specifically, under this condition, with = n e for < I < (1 — 2^/H)k — 
j — 2, we have by the Borel-Cantelli lemma the existence of an almost surely 
finite no such that n > no implies A n = b n \Q t „\ 2 > n ■ 
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Step 3: the error term B n . To control the term B n , we need two esti- 
mates, whose proofs are based on elementary facts on the trajectories of fBm, 
namely, boundedness and Holder-continuity, which are derived from that 
process's Gaussian and scaling properties. Details are in this article's ex- 
tended arXiv.org version. For any sequences b n ,t n such that b n <C t n (mean- 
ing \\m. n b n /t n = 0), for any fixed e > 0, there exists an almost-surely finite 
random variable c e such that, for all n > 0, we have 

(31) sup \Q 8 \ < c £ (tn) 1/2+e 

se[o,t n ] 

and 

(32) sup \Q t - Q s \ < c £ (tn) 1/2 - H+£ (b n ) H - £ . 

S,t&[t n —b n ,tn] 

One immediately obtains that almost-surely 

B n < 2c%(t n )V^(t n )V 2 - H+ t(b n ) H -c = 2 c 2^(i-^2 £ )- j( i + h- £ ) 

The statement that B n converges to (i.e., that e can be chosen positive 
while having the power in the last expression above be negative) now follows 
from assuming that j and k are related by 

(33) k(l-H)-j(l + H) <0. 

Step 4: conclusion. We may now use the results of the last two parts, 
namely, that A n > n while lmin^oo B n = 0, to conclude from (29) that the 
statement of the theorem holds, provided that conditions (30) and (33) hold, 
that is, 

<k< -J- 



1 - 1 - H" 

The theorem now follows because the relation 7 < 1/(1 + H) in condition 

j+2 ^ l+H , 
1-2-yH ^ 1~H- 



(C) implies that if j is large enough, we do indeed have -,^j~ 2 „ < jzM-j- O 



4.2. The case H > 1/2. Due to the fact that the function Q is less regular 
in this case, we should not expect the proof of the following theorem to be 
a consequence of the proof in the case H < 1/2. Nevertheless, it deviates 
from the former proof very little. On the other hand, we cannot rely on 
Proposition 2 to find a convenient sufficient condition for condition (C); 
instead, we can look to the nonlinear class of examples in Lemma 3, which 
satisfy the strong version (7 = 0) of condition (C) for all H £ (0,1). The 
next result's proof is in the Appendix. 

Theorem 3. Assume that H > 1/2 and b satisfies condition (C) with 
7 = [e.g., b satisfies condition (28) in Lemma 3]. Then the maximum 
likelihood estimator 6t is strongly consistent. 
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5. The linear case. In this section we present some comments in the 
case when the drift b is linear. We will assume that b(x) = x to simplify the 
presentation. In this case, the solution X to equation (7) is the fractional 
Ornstein-Uhlenbeck process and it is possible to prove more precise results 
concerning the asymptotic behavior of the maximum likelihood estimator. 

Remark 2. In [9] it is shown that, for any H S (0, 1), there is a unique 
almost surely continuous process X satisfying (7), and it can be represented 
as 

(34) X t = f e^- u UB^ te[0,T], 

Jo 

where the above integral is a Wiener integral with respect to B H (which 
exists also as a pathwise Riemann-Stieltjes integral). It follows from the 
stationarity of the increments of B H that X is stationary and the decay 
of its autocovariance function is like a power function. The process X is 
ergodic, and for H > 1/2 it exhibits long-range dependence. 

Let us briefly recall the method employed in [18] to estimate the drift 
parameter of the fractional OU process. Let us consider the function, for 
0<s<t<l, 

k(t, s) = cjjh 1 / 2 - 11 ^ - s) 1 / 2 - 11 with c H = 2#T(f - H)T(H + ±) 

(35) 

and let us denote its Wiener integral with respect to B by 

(36) M t H = [ t k(t,s)dB?. 

Jo 

It has been proved in [28] that M H is a Gaussian martingale with bracket 

(37) (M*),:^^- mt ^ H ^JE^M±m. 

The authors called M H the fundamental martingale associated to fBm. 
Therefore, observing the process X given by (7) is the same thing as ob- 
serving the process 

Zf B = f t k(t,s)dX s , 
Jo 

which is actually a semimartingale with the decomposition 

(38) Z t KB = e rQ* B <Lf + M*, 

Jo 

where 

(39) Q? B = ^h j*k(t,s)X 8 ds, te[0,T]. 
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By using Girsanov's theorem (see [18] and [28]), we obtain that the MLE is 
given by 

ft n KB j 7 kb 

(40) 9 t :=ef B - JoQs dZs 



Remark 3. We can observe that our operator (12) or (23) coincides 
(possibly up to a multiplicative constant) with the one used in [18] and 
given by (40). Assume that H < 1/2; the case H > 1/2 is just a little more 
technical. 



PROOF. Using relations (11) and (35), we can write 

Qt = C{H)t H - 1 ' 2 r sV 2 - H (t - a )-^ 2 - H b(X a ) ds 
Jo 

= C(H)t H ~ 1 / 2 f ^-k(t,s)b(X s )ds 
Jo at 

= C{H)t H ~ 1 ' 2 ^- f k(t,s)b(X s )ds. 
at Jo 

It is not difficult to see that f t J* k(t, s)b(X 3 ) ds = C^t 1 ' 211 Qf B and 
therefore, 

(41) Q t = C{H)t l ' 2 - H Qf B . 
On the other hand, it can be similarly seen that 

(42) Zf B = C{R) f s 1/2 ~ H dZ s 

Jo 

and the estimations given by (40) and (23) coincide up to a constant. □ 

To compute the expression of the bias and of the mean square error and 
to prove the strong consistency of the estimator, one has the option, in this 
explicit linear situation, to compute the Laplace transform of the quantity 
Jo(Qf B ) 2 dwf. This is done for H > 1/2 in [18], Section 3.2, and the follow- 
ing properties are obtained: 

• the estimator Bt is strongly consistent, that is, 

Qt — > 6 almost surely when t — > oo; 

• the bias and the mean square error are given by: 
- If 9 < 0, when t — > oo, then 

(43) E (0 t -0)-p B(9 t -9) 2 ^^\9\; 
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- If 6 > 0, when t — > oo, then 

(44) E(0 t - 0) - -2Vvrsin7rif6» 3/2 e~ e *^t, 

(45) E(0 t - #) 2 - 2VTTsmTrHe 5/2 e- et Vi. 

It is interesting to realize that the rate of convergence of the bias and of 
the mean square error does not depend on H . In fact, the only difference 
between the classical case (see [26]) and the fractional case is the presence 
of the constant \J irH in (43), (44) and (45). It is natural to expect the same 
results if H < 1/2. This is true, as stated next, and proved in Section A. 2. 

Proposition 3. If H < 1/2, then (43), (44) and (45) hold. 

6. Discretization. In this last section we present a discretization result 
which allows the implementation of an MLE for an fBm-driven stochastic 
differential equation. 

We first provide background information on discretely observed diffusion 
processes in the classical situation when the driving noise is the standard 
Brownian motion. Assume that 

dX t = b(X t ,0) + a(X t ,0)dW t , 

where a, b are known functions, W is a standard Wiener process and 6 is the 
unknown parameter. If continuous information is available, the parameter 
estimation by using the maximum likelihood method is somewhat simpler; 
indeed, the maximum likelihood function can be obtained by means of the 
standard Girsanov theorem and there are results on the asymptotic behavior 
of the estimator (consistency, efficiency etc.). We refer to the monographs 
[4, 34] or [24] for complete expositions of this topic. 

"Real- world" data is, however, typically discretely sampled (e.g., stock 
prices collected once a day or, at best, at every tick). Therefore, statistical 
inference for discretely observed diffusions is of great interest for practi- 
cal purposes and at the same time it poses a challenging problem. Here 
the main obstacle is the fact that discrete-time transition functions are not 
known analytically and consequently, the likelihood function is in general 
not tractable. In this situation there are alternative methods to treat the 
problem. Among these methods, we refer to numerical approximation to 
the likelihood function (see [1, 5, 32]), martingale estimating functions (see 
[6]), indirect statistical inference (see [16]) and Bayesian approaches (see 
[15]). We refer to [36] for a survey of methods of estimation in the dis- 
crete case. When the transition functions of the diffusion X are known, 
and o~(x,6) = ax with a unknown and not depending on 9, then Dacunha- 
Castelle and Florens-Zmirou [12] propose a maximum likelihood estimator 
which is strongly consistent for the pair (6,o~). They also give a measure of 
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the loss of information due to the discretization as a function depending on 
the interval between two observations. 

A more particular situation is the case when a is known (assume that a = 
1). Then the maximum likelihood function, given by exp(# Jq6(X s ) dX s — 

Tj- Jq6(X s ) 2 ds), can been approximated using Riemann sums as 

/ N-1 q2 N-1 \ 

exp £ b(X ti )(X ti+1 -X ti )--J2 b(X u f{U +1 -U)\. 
\ i=o z i=0 J 

As a consequence, the following maximum likelihood estimator can be ob- 
tained from the discrete observations of the process X at times to, . . . , in 
a fixed interval [0, T], with discrete mesh size decreasing to as N — > oo: 

(46) 0at,t- ^o^P^X^!-^) 



(see [25], including proof of convergence to the continuous MLE). 

In the fractional case, we are aware of no such results. We propose a 
first concrete solution to the problem. We choose to work with the formula 
(23) by replacing the stochastic integral in the numerator and the Riemann 
integral in the denominator by their corresponding approximate Riemann 
sums, using discrete integer time. Specifically, we define, for any integer 
n> 1, 

r^j\ q ._ Em=0 Qm{Z m +l ~ Z m ) 



J2m=0 \ Q 



in 



2 



Our goal in this section is to prove that 9 n is in fact a consistent estimator 
for 6. By our Theorems 2 and 3, it is of course sufficient to prove that 
]xm n -n X> (0 n — n ) = almost surely. One could also consider the question of 
the discretization of 0? using a fine time mesh for fixed T, and show that 
this discretization converges almost surely to 0t ; by time-scaling, such a goal 
is actually equivalent to our own. 

It is crucial to note that in the fractional case the process Q given by 
(11) depends continuously on X and, therefore, the discrete observation 
of X does not allow one to obtain directly the discrete observation of Q. 
We explain how to remedy this issue: Q m appearing in (47) can be easily 
approximated if we know the values of X n ,n > 1, since only a deterministic 
integral appears in the expression of (11); indeed, for H < 1/2, the quantity 



n-1 



(48) Q n = c(H)n H -^Y,( n -3)' H ~ 1/2 J 1/2 ~ H b(X 3 ) 

can be deduced from observations and it holds that \im n (Q n — Q n ) = 
almost surely. This last fact requires proof, which is simpler than the proof 
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of convergence of 9 n — 9 n to 0, but still warrants care; we present the crucial 
estimates of this proof in Section A.3.1. 

Note, moreover, that calculation of 9 n also relies on Z m , which is not 
observable; yet from formula (21), where Z m is expressed as a stochastic 
integral of a deterministic function against the increments of X , again, we 
may replace all the Z m 's by their Riemann sum; proving that these sums 
converge to the Z^s follows from calculations which are easier than those 
presented in Section A.3.1, because they only require discretizing the de- 
terministic integrand. We summarize this discussion in the following precise 
statement, referring to Section A.3.1 for indications of its proof. 

Proposition 4. With Q n as in (48) and Z n = Ej=o(^*' -ll [o,n] (0)(i) x 
(Xj + i — Xj), then almost surely 9 n — 9 n converges to 0, where 9 n is given 
by (47) with Z and Q replaced by Z and Q. 

Let {M) n denote the quadratic variation at time n of a square- integr able 
martingale M. We introduce the two semimartingales 

(49) A t := f Q s dZ s , 

Jo 

(50) B t := J Q [a] dZ„ 

where [s] denotes the integer part of s. We clearly have B n = X)^=o Qm(Z m +i - 
Z m ). Thus, using the fact that Z is a Brownian motion under P, we see that 



(51) (B) n =J2\Q 
while 



n-l 

2 

m | i 

m=0 



n-l 



(52) (A-B) n = \Q s -Q[s]\ 2 ds=J2 \Q S -Q m \ 2 ds. 

J ° m=0 Jm 

Therefore, from definitions (12) and (47), we immediately get the expressions 

A n . - B n 



and 9 n 



" (A) n n {B) n 

The following proposition defines a strategy for proving that 9 n - and, by 
the previous proposition, 9 n - is a consistent estimator for 9. See Section 
A. 3. 2 for its proof. 



Proposition 5. Let H e (0,1). // there exists a constant a > such 
that 
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• n a {A — B) n /{B) n is bounded almost surely for n large enough, 

• for all k>l, for some constant K > 0, almost surely, for large n, (-B}„ > 



The following theorem, proved in Section A. 3. 3 under the condition (C) 
below, which is stronger than (C), still allows for nonlinear examples. 

Theorem 4. Assume b' is bounded and the following condition holds: 

(C 7 ) There exist constant to, Kb > depending only on H and b, such that, 
for all t>to and all e > 0, P[\Qt(oj)\/Vt < s] < eKt,, where under P, 
Cj has the law of fBm with parameter H. 

Then for all H £ (0,1/2), almost surely lim n _K 30 $ ri = 9, where the dis- 
cretization 9 n of the maximum likelihood estimator 9 n is defined in (47) . If 
H £ (1/2,1), the same conclusion holds if we assume in addition that b" is 
bounded. 

By Proposition 4, the above statements hold with 9 replaced by 9. 

Remark 4. Condition (C) holds as soon as the random variable Qt{&) / \fi 
has a density that is bounded uniformly t. When H < 1 /2, this is a statement 
about the random variables Jq 1 {ds)b{t H uj s )t~ H . In all cases, condition 
(C) holds for the class of nonlinear functions defined in Lemma 3. 

We conjecture that Theorem 4 holds if we replace (C) by (C), in view, 
for example, of the fact that the conditions of Proposition 5 hold for any 
a < 2H. Step 1 in the theorem's proof is the obstacle to us establishing this. 



A.l. Proof of Theorem 3. Recall from the proof of Proposition 1 that 
we can write 



We note that in this case the expression ^(dr) does not determine a mea- 
sure, but we still use this notation to simplify the presentation; the Lipschitz 
assumption on b and the Holder property of X do ensure the existence of 
the integral. 

One can actually follow the proof in the case H < 1/2 line-by-line. All 
we have to do here is prove an equivalent of relations (31) and (32) on the 
supremum and variations of Q, this being the only point where the form 
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(53) 
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of Q, which differs depending on whether H is greater or less than 1/2, is 
used. We briefly indicate how the second summand of Q in (53) (which is 
the most difficult to handle) can be treated. 

Under P, we denote X by Q, since its law is that of standard fBm. The 
quantity Q' t := / * f4j(dr)(b(X t )-b(X r )) equals $ fi 1 H (dr)t- H (b(t H uj 1 )-b(t H Cj ] 
in distribution. Now, let V( := t~ l l 2 Q' t . Omitting the details, we state that 
instead of relations (31) and (32) on Q' , it is equivalent to show the following 
bound for some M > 2: 

- r 1 fb \ HM 

(54) E sup \V(-V^\ M <C M>H Jf) , 

which follows from some elementary calculations, and a standard application 
of the Kolmogorov lemma on continuity (see [35], Theorem 1.2.1). Details 
are in the extended version of this article on arXiv. A direct proof, via (31) 
and (32), not via (54), is also possible. 

A. 2. Proof of Proposition 3. To avoid tedious calculations with frac- 
tional integrals and derivatives, we will take advantage of the calculations 
performed in [18] when H > 1/2; nevertheless, we believe that a direct proof 
is also possible. Actually, the only step where the authors of [18] use the fact 
that H is greater than 1/2 is the computation of the process Q. By relations 
(20) and (22), we can write 

Qt = J t J* j\K*>- x l m {-)){s)K{s,v)dsdZ v . 

Note that from the formulas presented in Section 2, we have 

{K^l m {-)){s) = c(H)s^ f s n^ H {u - s)- H - 1/2 , H <\, 

(^- 1 l M (0)( S ) = C (^) 5 1/2 - H ^/^ 1/2 - lf (^-^)^ +1/2 , H>\. 
To unify the notation, we write 

(K*'-H m (.))(s) = c(H)s^ H ±- s f s U^- H {u - 8 )- B +V\ H € (0, 1), 

and we just observe that the constant c{H) above is analytic with respect 
to H. Let us consider, for v < t, a function A(v,t) such that 

^ A(v, s) ds = /V*' -1 !^-))^)^ v) ^. 

Jv 

Then, obviously, Qt = Jq A(t,v) dZ v . 

On the other hand, it has been proved in [18] [see relations (3.4) and (3.5) 
therein] that, for H > 1/2, Qf B = /* A KB (t,v) dZ* B with A KB (t,s) = 
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c(H)(t 2H 1 + s 2H 1 ). Using the relations between Q and Q KB and between 
Z and Z KB (see Remark 3), it follows that, for every H > 1/2, and s <t, 



(55) A(s,t) = c(H) 



1/2-H / + \ 1/2-H 
+ 



S 

tj \s 



We show that the above relation (55) is true for H < 1/2 as well. We use 
an argument inspired by [13], proof of Theorem 3.1. We observe that the 
functions 



H E (0, 1) — > A(s,t) and H E (0, 1) — > c(H) 



1/2-H / + \ 1/2-H 



S 

1 J ' \s 



are analytic with respect to H and coincide on (1/2, 1). Moreover, both are 
well-defined for every H E (0, 1) (in fact, it follows from [18] that A is well 
defined for H > 1/2 and it is more regular for H < 1/2). To conclude (55) for 
every H E (0, 1), we invoke the fact that if /, g: (a,b) — > M. are two analytic 
functions and the set {x E (a, 6); f(x) = g(x)} has an accumulation point in 
(a, b), then f = g. 

As a consequence, (55) holds for every H E (0, 1) and this shows that 

I* Q s dZ s = I* Qf B dZf B = c(H) (z? B I* r™" 1 dZ« B - t 
Jo Jo V Jo 

and all calculations contained in [18], Sections 3.2, 4 and 5 hold for every 
H. □ 

A.3. Proof of Theorem 4. 

A. 3.1. Proof of Proposition 4. For conciseness, we only indicate how to 
establish one of the crucial estimates for this proposition, that the quantity 

q J2m=o(Qm — Qm){Z m +l ~ Z m ) 

converges to almost surely, and then only for H < 1/2. Since we want to 
show that S n tends to almost surely, and P and P share the same null 
sets, we may assume that Z is a Brownian motion, and A is a fractional 
Brownian motion adapted to Z : s filtration. 
Define the quantity 

Rn = X> " j)- H - 1/2 j 1/2 - H f + Wi) " b{X s ))ds. 

This is related to S n via the fact that m H ~ 1 ^ 2 \R m \ = Q m — Q m . We claim 
that, for any e > 0, almost surely, for large m, that is, m > mo, \R m \ < 
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ro + m" H+1+£ CH\\b'\\, where ro is a fixed random variable. This is suffi- 
cient to conclude that lim n S n = 0. Indeed, we will see below [Section A. 3. 3, 
Step 2, inequality (61)] that, almost surely, for large n, J2m=o\Qm\ 2 > n 2 ■ 
Then the sum of all terms in the numerator of S n for m < mo, after having 
been divided by S n 's denominator, tends to when n — > oo. On the other 
hand, the IID terms {Z m+ \ — Z m } mg N are standard normal, so that one 
trivially proves that almost surely for n > uiq (abusively using the same 
mo as above), up to some nonrandom universal constant c, \Z m+ \ — Z m \ < 
c-y/logm. It follows that the portion of S n for m > mo is bounded above 
by n~ 2 J2m=o m^" 1 / 2 (ro + m~ i ^ +1+e c^||6 / ||)-v/logm, which is itself bounded 
above by (ro + ch ||&'||)n 3 / 2+£ , which obviously tends to as n — > oo as soon 
as e < 1/2. 

Now let us prove our claim on R m . It is a known fact, which is obtained us- 
ing standard tools from Gaussian analysis, or simply the Kolmogorov lemma 
(see [35], Theorem 1.2.1), that, for any M > 1, 



E 



sup \X t -X s \ M 
s,te\j,j+i] 



<J HM - 



The usual application of the Borel-Cantelli lemma after Chebyshev's in- 
equality for an M large enough implies that, for any a> H, almost surely, 
for large j, sup^g^ \X t — X s \ < j a . Consequently, for any e > 0, 



mo r j+l 

\R m \<2\\b'\\Y^{n-3)- H - ll2 3 l/2 - H \Xj-Xs\a 

3=0 Jj 



n-1 

+ W\\ z2( n -jT H - 1/2 3 1/2 - H sup \Xj-X s \ 
j=m se\j,j+i) 

ro + n-^llb'llJ^il-j/nr^U/n) 1 / 2 - 11 sup \X S - X. 
j=m 

n— 1 



j=m 

= r + c H n- H+£+1 (l + O(l/n)), 

a T?iomQnn oniric fni* 



where the last estimate is by virtue of the Riemann sums for Jq (1 ^ x (h+i/2 dx. 



A. 3. 2. Proof of Proposition 5. By our Theorems 2 and 3, it is of course 
sufficient to prove that lim n ^oo(^ n — n ) = 0. In preparation for this, we 
first note that by classical properties for quadratic variations, and using our 
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hypothesis, for large enough n we have 

\(B) n -(A) n \ = \((B-A),(B + A)) n \ 

(56) <\ {B + A) n \ l l 2 \{B-A) n \ l l 2 

^V^n^KBU^KA^ + iBU 1 / 2 . 
Now we prove that (56) implies, almost surely, 

(57) >^7# i = L 

\ B )n 



n— >oo 



Indeed, let x n = (A) n /(B) n . Then we can write 

\(B) n -(A) n \ 
lXn 11 " (B) n 

<V2n- a \{B) n \- l l 2 \{A) n + {B) n \ l l 2 

= cV2n~ a \l + x n \ 1 / 2 , 

where c is a possibly random almost surely finite constant. Let e > be 
given; it is elementary to check that the inequality (x — l) 2 < 2e(x + 1) 
is equivalent to \x — (1 + e)| < y/4e + e 2 . For us this implies immediately 
\%n ~ 1| — 6cn~ a , proving the claim (57). 
Now we have 

(KH\ ft - ft - An B ' a - An ~ Bn J- A (- S )" ~ (tyn 

{ j n n ~(A) n (B) n ~ (B) n +An (A) n (B) n - 

Using (56), we have that the second term in (58) is bounded above in abso- 
lute value by 

■ /n n -« A - IWn+jg)^ _ r 2n - a A^ f (A)n \ ^ 

V ' n (A) n \(B) n \^ " V2n (A) n {(B) n +1 ) ■ 

By Theorems 2 and 3, A n j (A) n converges to the finite constant 9. By the 
limit (57), the last term in the above expression converges to 2, so that the 
entire expression converges to 0. Let k and 7 be fixed positive values. For the 
first term in (58), using our hypotheses, by the Chebyshev and Burkholder- 
Davis-Gundy inequalities, and from the expression of the semimartingales 
Z as Z t = Jq Q s dW s + 6 J * Q s ds, we have 

P[|A» " B n \ k > n-*TE[(J3)*]] < n^E- 1 [( J B)^]E[|A„ - B n \ k ] 

< c(Q)2 k n ik n~ ka . 

Thus, picking a positive value 7 < a and choosing k large enough, by the 
Borel-Cantelli lemma, almost surely, for n large enough, 

\A n - B n \ < n-^[(B) k n ] llk < ^n^(B) n , 
which finishes the proof of the proposition. 
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A. 3. 3. Proof of Theorem 4 (Steps 1 through 4). In this entire proof Hq(uj) 
will denote a random almost surely finite integer; it may change from line 
to line, as it is introduced via various different applications of the Borel- 
Cantelli lemma, but one only needs to take the supremum of all such integers 
to have correct statements throughout. 

Step 0. Strategy. First note that since the probability measures P and P 
are equivalent (see Theorem 1), almost sure statements under one measure 
are equivalent to statements about the same stochastic processes under the 
other measure, and therefore, we may prove the statements in the theorem 
by assuming that the process Z in the definitions (49) and (50) is a standard 
Brownian motion, since such is its law under P. Furthermore, for the same 
reason, we can assume that, in these same definitions, Q is given by formula 
(11), where X is replaced by Co whose law is that of standard fBm. We will 
use specifically, instead of (11), the explicit formula (25) when H < 1/2. For 
H > 1/2, the formula (16) must be used instead, which shows the need for 
a control of 6's second derivative. For the sake of conciseness, we restrict 
our proof to the case H < 1/2. The result of the theorem is established as 
soon as one can verify the hypotheses of Proposition 5. Here we present 
only the proof of the first of the three hypotheses. The other two are proved 
using similar or simpler techniques. To achieve our goal in this proof, it is 
thus sufficient to prove that, almost surely, for large n, {B) n > n ai , while 
(A — B) n < n" 2 , where the values a\ and «2 are nonrandom and a\ > ct2- 
We establish these estimates in the Appendix. 

Step 1. Bounding \Q\ 2 below. Using only condition (C), we immediately 
get, for any 7 G (0, 1/2 — H), for any large t, 



To be able to apply the Borel-Cantelli lemma, we now let t = n , where n 
is an integer and A is a constant exceeding 7 _1 . We then get, almost surely, 
for any n > uq(uj), 



We also bound other Q m i 's that are in close proximity to Q n A ■ For any fixed 
integer m, consider the set Ifh of integers ml in the interval [fh — fh 4,y ,fh], 
where 7 is also assumed to be less than 1/4. Then by inequality (32), for 
any e > 0, for some almost-surely finite r.v. c e , 



nQt\<t l/2 ^}<K b t-^. 



(59) 



\Qn A I > n 



A(l/2- 7 ) 



sup \Q. 



■in 



Qm' I < c £ m 



1/2-H+e^ 



H-e 



rn' 



= c £ m 



l/ 2 -(tf- £ )(l-4 7 ) 



= c £ m 



1/2-e' 
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where e' = (H — e)(l — 47) is positive for e < H and our hypothesis on 7. 
Thus with < e' < H{1 — 47), almost surely, we may write that 

\Qm'\ > \Qm\ -c £ fh 1/2 - £ '. 

Certainly, if fh is of the form n A for large enough n, by choosing 7 small 
enough, we obtain that the lower bound m 1 / 2 ^ 7 on |Qm| 2 obtained in (59) 
is dominant compared to m}l 2 ~ e for e close to H(l — 47). Hence, we get, 
almost surely, with fh = n A large enough, for all ml E [fh — m 47 ,fh], 

(60) |Q^| 2 >IQ^I 2 (i- |Q _! j >l^l 2 /2- 

S'tep 2. Bounding (B) from below. For n given, let n\ be the largest 
integer such that n A <n< (m + l)" 4 . Also assume n is large enough so that 
n A > re o( w )- Thus, applying (60) with fh = n A , 



(B) m >J2\Qm.\ 2 >\Q n A\ 2 + \Q™>\ 2 

>\Q nA \ 2 + 2- 1 J2 \Qn-\ 2 >\Qn-\ 2 (n^. 
m'=nf-(nf 

We can now invoke (59) to say that, almost surely, for n > tiq{uj) a , 

(B) n > (n!) A (2- 27 )(<) 47 = 2- l ( ni ) A ^\ 

Given that we may write n A (l + n^ 1 ) > n, so that n A > n/2, we can finally 
conclude that 

,2+2-y 



( 61 ) > 2l+2A(1+7) 



n 



Step 3. Bounding (A—B)'s terms from above. We may generically bound 
the general term of (A — B) n directly using the bound (32) and its associated 
random variable c e : almost surely, for all m > 0, 

rm+l rm+l 

/ \Qs - Qm\ 2 ds < / ds SUp \Qt-Qm\ 2 

Jm Jm te[m,m+l] 

(62) = sup \Q t -Q m \ 2 

t£[m,m+l] 

< c £ {m l l 2 - H+£ l H - £ ) 2 = m 1 - 2 ^. 
We conclude that, for any 5 = 1 — e < H, almost surely, for all m, 

rm+l 

/ IQs-Qmfds^Cem 1 - 25 . 
J m 
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Step 4. Conclusion. From the formula (A — B) n = J2m=oIm +1 \Qs ~ 
Q m \ 2 ds, using the last estimate of the previous step, we get 

(A-B) n <c £ n 2 - 2S . 

From the final estimate (61) of Step 2, we may now write almost surely 

(A - B) n c e 
{B) n ~ n 2 (~f+ s ) ' 

Hence, the first statement of Proposition 5 is established for any a < 2H. 
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